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M. TUBERCULOSIS RNA POLYMERASE ALPHA SUBUNIT 

Field of the Invention 

The present invention relates to novel nucleic acids encoding RNA polymerase 
5 alpha subunit from M. tuberculosis and methods for use thereof. 

Background of the Invention 

The intracellular pathogen Mycobacterium tuberculosis is the causative agent 
of tuberculosis in humans and is responsible for millions of deaths worldwide each year 

10 (Bloom et al. Science 257:1055, 1992). The- emergence of multidrug resistant forms of 
tuberculosis has mandated the development of new antibiotics effective against refractory M. 
tuberculosis strains. Novel drugs which, like rifampin (Vall-Spinosa et al., N. Eng. J. Med. 
283 : 616, 1970), may be capable of inhibiting the prokaryotic transciptional machinery, could 
contribute significantly to the development of therapies to combat M. tuberculosis. 

15 The RNA polymerase of eubacteria consists of four subunits (a, /?, /3\ and o) 

and exists in two major forms: core enzyme (as, /?, £') and holoenzyme (as, /3, 0' plus one 
of several a subunits) (Chamberlin, in RNA Polymerase, Losick et al., eds., Cold Spring 
Harbor, 1976, pp. 17-67). The oc subunit contains determinants for protein-protein 
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interactions with transcription activators and protein-DNA interaction with upstream elements 
(Ishihama. MoLMicrobioL 6:3283, 1992; Russo et al., J. Biol.Chem. 267:14515, 1992; 
Ebright et al., Curr. Opin. Genet. Dev. 5:197, 1995). The amino terminal domain of a is also 
required for assembly of the multisubunit core RNA polymerase (Ishihama, Adw Biophys. 
5 14:1, 1981). The (3 subunit is essential for transcript initiation and elongation, which /?' 
apparently functions in binding of the core enzyme to template DNA (Yura et al., 
Ann.Rev. Genet. 13:59, 1979). Core RNA polymerase is capable of RNA synthesis; however, 
a is required for specific initiation of transcripts at promoter sequences (Gross et al., in 
Transcriptional Regulation, Cold Spring Harbor, 1992, pp. 129-176). 

10 Discovery of inhibitors of M. tuberculosis RNA polymerase is hampered by 

a lack of information concerning components of the M. tuberculosis transcriptional apparatus, 
difficulties in obtaining sufficient yields of active enzymes for biochemical studies, and 
biosafety concerns. Establishment of an in vitro transcription system employing purified and 
reconstituted RNA polymerase would greatly advance efforts to identify new therapeutic 

15 agents active against tuberculosis. 

Accordingly, there is a need in the art for compositions and methods utilizing 
cloned genes and purified proteins derived from M. tuberculosis RNA polymerase. 

Summary of the Invention 

20 The present invention is based on the isolation and characterization of DNA 

encoding, the a subunit of RNA polymerase derived from M. tuberculosis. In one aspect, the 
invention provides a purified, isolated nucleic acid having the sequence shown in Figure 3. 
The invention also encompasses sequence-conservative and function-conservative variants of 
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this sequence. The invention also provides vectors comprising these sequences, and cells 
comprising the vectors. 

In another aspect, the present invention provides a purified, isolated 
polypeptide encoded by the nucleic acid sequence shown in Figure 3, as well as function- 
conservative variants thereof. In one embodiment, the invention provides a purified a subunit 
further comprising a hexahistidine tag. The invention also provides purified, reconstituted 
core- and holoenzyme comprising the a subunit. 

In yet another aspect, the invention provides in vitro methods for high- 
throughput screening to detect inhibitors of M. tuberculosis RNA polymerase. In one 
embodiment, the methods comprise: 

a) providing a mixture comprising 

(i) purified M. tuberculosis RNA polymerase and 

(ii) a DN A template encoding a promoter sequence that is recognized 
by M. tuberculosis RNA polymerase; 

b) incubating the mixture in the presence of test compounds to form test 
samples, and in the absence of test compounds to form control samples, under conditions that 
result in RNA synthesis in the control samples; ... 

c) measuring RNA synthesis directed by said M. tuberculosis-recognized 
promoter in the test and control samples; and 

d) comparing the RNA synthesis detected in step (c) between the test and 
control samples. According to the invention, an RNA polymerase inhibitor is a test 
compound that causes a reduction in RNA synthesis directed by the M. tuberculosis- 
recognized promoter measured in the test sample relative to RNA synthesis measured in the 
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control sample. In a preferred embodiment, the purified RNA polymerase used in practicing 
this method comprises recombinant subunits which are reconstituted to form an enzymatically 
active holoenzyme. In another embodiment, the ability of test compounds to bind to purified 
a subunit is monitored. 

In yet another aspect, the invention provides in vivo methods for high- 
throughput screening to detect inhibitors of M. tuberculosis RNA polymerase. The methods 
are carried out by the steps of: 

a) providing a non-mycobacterial bacterial strain, preferably E. coli, that 

(i) has been transformed with a DNA template encoding a promoter 
sequence that is recognized by M. tuberculosis RNA polymerase, and 

(ii) expresses enzymatically active M. tuberculosis RNA polymerase 0 
(e.g., a 2 , )3, £T plus one of several a subunits); 

b) incubating the bacterial strain of (a) in the presence of test compounds 
to form test samples, and in the absence of test compounds to form control samples; 

c) measuring RNA synthesis directed by said M. tuberculosis-recognized 
promoter in the test and control samples; and 

d) comparing the RNA synthesis directed by said M. tuberculosis- 
recognized promoter detected in step (c) between the test and control samples. According 
to the invention, an RNA polymerase inhibitor is a test compound that causes a reduction in 
RNA synthesis measured in the test sample relative to RNA synthesis measured in the control 
sample.. 

These and other aspects of the present invention will be apparent to those of 
ordinary skill in the art in light of the present specification and appended claims. 
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Brief Description of the Drawings 

Figure 1 is a graphic illustration comparing the deduced amino acid sequence 
of the M. tuberculosis rpoA gene fragment and corresponding regions of a subunits of B. 
subtilis and E. coli. Mt, M. tuberculosis: Bs, B. subtilis: Ec, E. coli. Dashed lines denote 
5 conserved segments of B. subtilis and E. coli proteins for which oligonucleotides were 
designed. 

Figure 2A is a photographic illustration of a Southern blot analysis of rpoA 
hybridizing cosmid clones. Four different positive clones (designated II A5, II A3, ID9, and 
IC5) isolated from an M. tuberculosis cosmid library were digested with Notl and hybridized 
10 with an rpoA gene probe. Sizes of DNA markers (X EcoT141, Amersham) are indicated in 
kb. 

Figure 2B is a photographic illustration of a Southern blot analysis of rpoA 
hybridizing cosmid clones, showing structural conservation of genomic and cosmid-bome 
rpoA sequences. Purified cosmid ICS DNA (lanes 1-5) or M. tuberculosis H37Rv genomic 

15 DNA (lanes 7-11) were digested with the indicated restriction enzymes and hybridized with 
an rpoA probe. Sizes of DNA markers (X EcoT141, Amersham) are indicated in kb. 

Figure 3 shows the nucleotide and deduced amino acid sequences of the At. 
tuberculosis H37Rv rpoA gene. RBS denotes a potential ribosome binding site. The putative 
start and stop codons of the rpoA gene product are at 85-87 and 1126-1128, respectively. 

20 Figure 4 is a graphic illustration comparing the inferred amino acid sequence 

of the M, tuberculosis H37Rv a subunit with sequences of a subunits from other bacteria. 
B. subtilis (Bs), M. genitalium (Mg), E. coli (Ec), H. influenza (Hi), B. pertusis (Bp), S. 
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typhimurium (St), and C. trachomitis (Ct). Shading indicates identical amino acid residues. 
Amino acid sequence alignments were performed using Meg Align (DNAStar). 

Figure 5 is a photographic illustration of an SDS-polyacrylamide gel 
electropherogram depicting the expression and purification of M. tuberculosis a suhunit in 
5 E. coli. Lane 1 shows molecular mass markers (kDa); lanes 2 and 3 show total proteins 
from uninduced and IPTG-induced cultures, respectively, of E. coli harboring pJH37; and 
lane 4 shows purified M. tuberculosis a subunit (approximately 38 kDa), which is indicated 
by an arrow. 

Figure 6 is a photographic illustration of an SDS-polyacrylamide gel 
10 electropherogram showing the reconstitution and purification of core- and holo- RNA 
polymerase from M. tuberculosis. Lanes 1 and 6 show molecular mass markers (kDa); lane 
2 shows a-histidine-containing core enzyme; lane 3 shows a histidine-containing holoenzyme; 
lane 4 shows N-terminal histidine-tagged primary sigma; and lane 5 shows purified E. coli 
holoenzyme. 

15 Figure 7 is a graphic illustration of/;? vitro transcription reactions employing 

reconstituted purified M. tuberculosis RNA polymerase holoenzyme and T101 and T125 
promoter-containing DNA templates. 

Detailed Description of the Invention 

20 All patent applications, patents and literature references cited herein are hereby 

incorporated by reference in their entirety. In the case of inconsistencies the present disclosure 
will prevail. 
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The present invention is based on the isolation of a fragment of the M. 
tuberculosis rpoA gene, encoding the a subunit of RNA polymerase. As described in Example 
1 below, the fragment was isolated by the polymerase chain reaction (PCR) using primers 
directed against regions of a that are highly conserved among Gram-positive and Gram- 
5 negative bacteria. The amplified DNA was utilized as a hybridization probe to recover the 
entire rpoA gene from a cosmid library of genomic DNA from virulent M. tuberculosis strain 
H37RV. Nucleotide sequencing indicated that the 1044 bp M tuberculosis rpoA open reading 
frame (ORF) encodes a protein of 347 amino acids which shows significant structural 
similanix to the a subunits of diverse bacterial species with greatest identity to the B. subtilis 

10 a protein <l ii!urc 1). 

Though structurally conserved throughout most of its length, the inferred amino 
acid sequence of the M tuberculosis a subunit diverged from that of B. subtilis and other 
bacteria at its extreme C-terminus. Interestingly, studies on E. coli RNA polymerase subunits 
have indicated that the C-terminal 94 residues of a function in interactions with certain 

15 transcriptional regulatory proteins, while the N-terminal domain may play a more highly 
conserved role in facilitating the core assembly flshihama. Mol Microbiol 6:3283, 1992; Russo 
et al.. J.liiolChem. 267:14515, 1992). 

The B. subtilis rpoA gene is part of a large operon that is cotranscribed with 
genes for initiation factor 1 and ribosomal proteins B, S13, Sll, and L17: and, unlike the 

20 pattern of gene organization seen in E. coli, the B. subtilis rpoA gene cluster is genetically 
linked to the rpoB-C operon encoding RNA polymerase components (3 and p' (Boylan et al., 
1 Bacterial 171:2553, 1992). 
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In practicing the present invention, many techniques in molecular biology, 
microbiology, recombinant DNA. and protein biochemistry such as these explained fully in, 
for example, Sambrook et aL, 1989, Molecular Cloning: A Laboratory Manual. Second 
Edition. Cold Spring Harbor Laboratory Press. Cold Spring Harbor. New York; DNA Cloning: 
A Practical Approach, Volumes 1 and II, 1985 (D.N. Glover ed.); Oligonucleotide Synthesis, 
1984, (M.L. Gait ed.); Transcription and Translation, 1984 (Hames and Higgins eds.); A 
Practical Guide to Molecular Cloning: the series, Methods in Enzymolog)> (Academic Press, 
Inc.); and Protein Purification: Principles and Practice, Second Edition ( Springer- Verlag, 
N.Y.), may be used. 

The present invention encompasses nucleic acid sequences encoding the a 
subunit of M. tuberculosis RNA polymerase, enzymatically active fragments derived therefrom, 
and related sequences. As used herein, a nucleic acid that is "derived from*' a sequence refers 
to a nucleic acid sequence that corresponds to a region of the sequence, sequences that are 
homologous or complementary to the sequence, and "sequence-conservative variants" and 
"function-conservative variants". Sequence-conservative variants are those in which a change 
of one or more nucleotides in a given codon position results in no alteration in the amino acid 
encoded at that position. Function-conservative variants are those in which a given amino acid 
residue in the a subunit has been changed without altering the overall conformation and 
function of the polypeptide, including, but not limited to, replacement of an amino acid with 
one having similar physico-chemical properties (such as, for example, acidic, basic, 
hydrophobic, and the like). Fragments of the a subunit that retain enzymatic activity can be 
identified according to the methods described herein, such as e.g., expression in E. coli 
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followed by enzymatic assay of the cell extract or by purification and reconstitution of the 
purified polypeptide into an enzyrnatically active holoenzyme. 

The nucleic acids of the present invention include purine- and pyrimidine- 
containing polymers of any length, either polyribonucleotides or polydeoxyribonucleotides or 
5 . mixed polyribo-polydeoxyribo nucleotides. This includes single- and double-stranded 
molecules, i.e., DNA-DNA, DNA-RNA and RNA-RNA hybrids, as well as "protein nucleic 
acids" (PNA) formed by conjugating bases to an amino acid backbone. This also includes 
nucleic acids containing modified bases. The nucleic acids may be isolated directly from cells. 
Alternatively. PCR can be used to produce the nucleic acids of the invention, using either 

10 chemically synthesized strands or genomic material as templates. Primers used for PCR can 
be synthesized using the sequence information provided herein and can further be designed to 
introduce appropriate new restriction sites, if desirable, to facilitate . incorporation into a given 
vector for recombinant expression. 

The nucleic acids of the present invention may be flanked by natural M 

15 tuberculosis regulatory sequences, or may be associated with heterologous sequences, including 
promoters, enhancers, response elements, signal sequences, polyadenylation sequences, introns, 
5*- and 3*- noncoding regions, and the like. The nucleic acids may also be modified by many 
means known in the art. Non-limiting examples of such modifications include methylation, 
"caps", substitution of one or more of the naturally occurring nucleotides with an analog, and 

20 internucleotide modifications such as. for example, those with uncharged linkages (e.g., methyl 
phosphonates, phosphotriesters, phosphoroamidates, carbamates, etc.) and with charged 
linkages (e.g., phosphorothioates, phosphorodithioates. etc.). Nucleic acids may contain one 
or more additional covalently linked moieties, such as. for example, proteins (e.g., nucleases, 
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toxins, antibodies, signal peptides, poly-L-lysine, etc.). intercalators (e.g.. acridine. psoralen, 
etc.). chelators (e.g.. metals, radioactive metals, iron, oxidative metals, etc.), and alkylators. 
The nucleic acid may be derivatized by formation of a methyl or ethyl phosphotriester or an 
alky 1 phosphoramidate linkage. Furthermore, the nucleic acid sequences of the present 
5 invention may also be modified with a label capable of providing a detectable signal, either 
directly or indirectly. Exemplary labels include radioisotopes, fluorescent molecules, biotin, 
and the like. 

The invention also provides nucleic acid vectors comprising the disclosed a 
subunit sequences or derivatives or fragments, thereof. A large number of vectors, including 

10 plasmid and lungal vectors, have been described for replication and/or expression in a variety 
of eukaryotic and prokaryotic hosts. Non-limiting examples include pKK plasmids (Clontech), 
pUC pbsmids. pET plasmids (Novagen, Inc., Madison, WI), or pRSET or pREP (Invitrogen, 
San Diego. CA). and many appropriate host cells, using methods disclosed or cited herein or 
otherwise known to those skilled in the relevant art. Recombinant cloning vectors will often 

15 include one or more replication systems for cloning or expression, one or more markers for 
selection in the host, e.g. antibiotic resistance, and one or more expression cassettes. Suitable 
host cells may be transformed/transfected/infected as appropriate by any suitable method 
including electroporation, CaCl 2 mediated DNA uptake, fungal infection, microinjection, 
microprojectile, or other established methods. 

20 Appropriate host cells include bacteria, archebacteria, fungi, especially yeast, 

and plani and animal cells, especially mammalian cells. Of particular interest are E. coli y B. ) 
subiilis.Saccharomycescerevisiae,SaccharomycescarIsbergensis t Schizosaccharomycespombi, 
SF9 cells. CI 29 cells, 293 cells, Neurospora. and CHO cells, COS cells, HeLa cells, and 
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immortalized mammalian myeloid and lymphoid cell lines. Preferred replication systems 
include M13, ColEL SV40. baculovirus. lambda, adenovirus, and the like. A large number 
of transcription initiation and termination regulatory regions have been isolated and shown to 
be effective in the transcription and translation of heterologous proteins in the various hosts. 
5 Examples of these regions, methods of isolation, manner of manipulation, etc. are known in 
the art. Under appropriate expression conditions, host cells can be used as a source of 
recombinantly produced Mycobacterial-derived peptides and polypeptides. 

Advantageously, vectors may also include a transcription regulatory element 
(i.e.. a promoter) operably linked to the a subunit portion. The promoter may optionally 

10 contain operator portions and/or ribosome binding sites. Non-limiting examples of bacterial 
promoters compatible with E. coli include: trc promoter, p-lactamase (penicillinase) promoter; 
lactose promoter; tryptophan (trp) promoter; arabinose BAD operon promoter; lambda-derived 
PI promoter and N gene ribosome binding site; and the hybrid tac promoter derived from 
sequences of the trp and lac UV5 promoters. Non-limiting examples of yeast promoters 

15 include 3-phosphoglycerate kinase promoter, glyceraldehyde-3-phosphate dehydrogenase 
(GAPDH) promoter, galactokinase (GALlj promoter, galactoepimerase promoter, and alcohol 
dehydrogenase (ADH) promoter. Suitable promoters for mammalian cells include without 
limitation viral promoters such as that from Simian Virus 40 (SV40), Rous sarcoma virus 
(RSV), adenovirus (ADV), and bovine papilloma virus (BPV). Mammalian cells may also 

20 require terminator sequences and poly A addition sequences, and enhancer sequences which 
increase. expression may also be included. Sequences which cause amplification of the gene 
may also be desirable. Furthermore, sequences that facilitate secretion of the recombinant 
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product from cells, including, but not limited to. bacteria, yeast, and animal cells, such as 
secretory signal sequences and/or prohormone pro region sequences, may also be included. 

Nucleic acids encoding wild-type or variant a subunit polypeptides may also 
be introduced into cells by recombination events. For example, such a sequence can be 
5 introduced into a cell, and thereby effect homologous recombination at the site of an 
endogenous gene or a sequence with substantial identity to the gene. Other recombination- 
based methods, such as non-homologous recombinations or deletion of endogenous genes by 
homologous recombination, may also be used. 

a subunit-derived polypeptides according to the present invention, including 
10 function-conservative variants, may be isolated from wild-type or mutant M. tuberculosis cells, 
or from heterologous organisms or cells (including, but not limited to, bacteria, fungi, insect, 
plant, and mammalian cells) into which an a subunit-derived protein-coding sequence has been 
introduced and expressed. Furthermore, the polypeptides may be part of recombinant fusion 
proteins (see, e.g.. Example 2 below). Alternatively, polypeptides may be chemically 
15 synthesized by commercially available automated procedures, including, without limitation, 
exclusive solid phase synthesis, partial solid phase methods, fragment condensation or classical 
solution synthesis. 

"Purification" of an a subunit polypeptide refers to the isolation of the 
polypeptide in a form that allows its enzymatic activity to be measured without interference 
20 by other components of the cell in which the polypeptide is expressed. Methods for 
polypeptide purification are well-known in the art. including, without limitation, preparative 
disc-gel electrophoresis, isoelectric focusing, HPLC, reversed-phase HPLC, gel filtration, ion 
exchange and partition chromatography, and countercurrent distribution. For some purposes, 
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it is preferable to produce the polypeptide in a recombinant system in which the protein 
contains an additional sequence tag that facilitates purification, such as. but not limited to, a 
polyhistidine sequence. The polypeptide can then be purified from a crude lysate of the host 
cell by chromatography on an appropriate solid-phase matrix. Alternatively, antibodies 
5 produced against the a subunit or against peptides derived therefrom can be used as 
purification reagents. Other purification methods are possible. 

The isolated polypeptides may be modified by, for example, phosphorylation, 
sulfation, acylation, or other protein modifications. They may also be modified with a label 
capable of providing a detectable signal, either directly or indirectly, including, but not limited 
10 to, radioisotopes and fluorescent compounds. 

Screening Methods to Identify Anti-tuberculosis Agents 

The methods and compositions of the present invention can be used to identify 

compounds that inhibit the function of M. tuberculosis RNA polymerase and thus are useful 
15 as anti-tuberculosis agents. This is achieved by providing enzymatically active recombinant 

a subunit according to the present invention, in combination with other components of RNA 

polymerase, in a context in which the inhibitory effects of test compounds can be measured. 

Alternatively, the ability of test compounds to bind to purified a subunit is monitored, using 

methods well-known in the art. Test compounds identified using the methods of the invention 
20 encompass those that interfere with the enzymatic activity of RNA polymerase and/or disrupt 

the interaction of the a subunit with transcriptional activators. Thus, the compounds may 

function as general or gene-specific transcriptional inhibitors. 
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In a preferred embodiment, recombinant M. tuberculosis RNA polymerase 
subunits (a, P, plus one of several a subunits) are purified in milligram quantities from E. 
coli cultures by affinity methods utilizing a hexahistidine tagged a subunit. Enzymatically 
active holoenzyme is reconstituted using these components (see. e.g.. Example 2 below). The 
5 active polymerase is then incubated in the presence of test compounds to form test mixtures, 
and in the absence of test compounds to form control mixtures. In vitro transcription is then 
carried out using a DNA template containing appropriate promoter sequences (see, e.g.. 
Example 2 below). 

In another embodiment, M. tuberculosis RNA polymerase subunits (a 2 , P, P' 
10 plus one of several a subunits) are co-expressed in E. coli or another surrogate bacterial cell, 
in conjunction with an appropriate promoter functionally linked to a reporter gene, The ability 
of test compounds to differentially inhibit M. tuberculosis RNA polymerase is then assessed 
by measuring the amount and/or activity of the reporter gene product. . 

M. tuberculosis promoters useful in practicing the invention include without 
15 limitation: PI or P2 hsp 60 promoters (Stover et al., Nature 351 :456, 1991); cpn-60 promoter 
(Kong et al., Proc.NatlAcad.Sci.USA 90:2608, 1993): 85A antigen promoter (Kremer. J. 
Bacteriol. 177:642, 1995); PAN promoter (Murray et al., Mol Microbiol. 6:3331, 1992); 16S 
RNA promoter (Ji et al., Microbiol. 140: 2829, 1994); and askp promoter (Cirillo et al., Mol 
Microbiol. 11:629, 1994). Useful reporter genes include without limitation xylE (Curcic et 
20 al., Mol Microbiol 13:1057, 1994); CAT (Das Gupta et al:, J. Bacteriol 175:5186, 1993); 
luciferase (Cooksey et al., Antimicrob.Agts. Chemother. 37:1348, 1993); green fluorescent 
protein (Dhadayuthap et ah, Mol Microbiol 17:901, 1995); and lacZ. (Silhavy et al., 
MicrobiolRev. 49:398, 1985). 
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It will be understood that the present invention encompasses M. tuberculosis 
RNA polymerases containing any appropriate a factor, which is used in conjunction with a 
particular promoter that is recognized by RNA polymerase containing that a factor. Non- 
limiting examples of useful a factors include a A . o l \ and a' (Doushkan et al.. 1995; DeMaio 
5 et aL Gene 165 :67. 1995). The invention also encompasses the identification of additional 
promoters that are recognized by a particular a subunit.. This is achieved by providing a 
library of random M tuberculosis gene fragments cloned upstream of an appropriate reporter 
gene (see above). The library is transformed into M tuberculosis or M. smegma t is and 
reporter gene expression is measured. Alternatively, the library is transformed into another 
. 10 bacterial cell, such as, e.g., E. coli, which expresses M tuberculosis RNA polymerase core 
subunits as well as a a subunit of interest and connate promoters that drive reporter gene 
expr vision. In yet another embodiment, expression of an M. tuberci 7 osis a factor confers new 
recognition properties on E. coli RNA polymerase and permits isolation of promoters utilized 
specifically by a particular M tuberculosis a subunit. 

15 Preferably, both in vitro and in vivo screening methods to detect M. 

tuberculosis-specific RNA polymerase inhibitors are adapted to a high-throughput format 
allowing a multiplicity of compounds to be tested in a single assay. Such inhibitory 
compounds may be found in, for example, natural product libraries, fermentation libraries 
(encompassing plants and microorganisms), combinatorial libraries, compound files, and 

20 synthetic compound libraries. For example, synthetic compound libraries are commercially 
available. from Maybridge Chemical Co. (Trevillet, Cornwall, UK). Comgenex (Princeton, NJ), 
Brandon Associates (Merrimack, NH), and Microsource (New Milford, CT). A rare chemical 
library is available from Aldrich Chemical Company, Inc. (Milwaukee, WI). Alternatively, 
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libraries of natural compounds in the form of bacterial, fungal, plant and animal extracts are 
available from, for example. Pan Laboratories (Bothell. WA) or MycoSearch (NC), or are 
readily producible. Additionally, natural and synthetically produced libraries and compounds 
are readily modified through conventional chemical, physical, and biochemical means 
(Blondelle et al., TibTech 14:60, 1996). preferably using automated equipment, to allow for 
the simultaneous screening of a multiplicity of test compounds. 

Useful anti-tuberculosis compounds are identified as those test compounds that 
decrease tuberculosis-specific transcription. Once a compound has been identified by the 
methods of the present invention as an RNA polymerase inhibitor, in vivo and in vitro tests 
may be performed to further characterize the nature and mechanism of the inhibitory activity. 
For example, classical enzyme kinetic plots can be used to distinguish, e.g., competitive and 
non-competitive inhibitors. 

Compounds identified as RNA polymerase inhibitors using the methods of the 
present invention may be modified to enhance potency, efficacy, uptake, stability, and 
suitability for use in pharmaceutical formulations, etc. These modifications are achieved and 
tested using methods well-known in the art. 

The present invention is further described in the following examples which are 
intended to further describe the invention without limiting the scope thereof. 
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Example 1: Isolation and Analysis of the M. tuberculosis rpoA Gene 

The following experiments were performed in order to clone the gene encoding 
M. tuberculosis RNA polymerase a subunit. In these experiments, the following materials and 
5 methods were used. 

A. Materials and Methods 

PCR amplification: M tuberculosis H37Rv genomic DNA (1 ng) was amplified 
in a 100 \i\ reaction containing: 10 mM Tris-HCl pH 8.3, 1.5 mM MgCl 2 , 500 mM KC1, 2.5 
U Taq polymerase (Beohringer-Mannheim). 0.2 mM dNTPs and oligonucleotide primers (1 
10 |iM each): 5 ' -CTACGC AAGC AGGGTCCGGGTG AG-3 ' and 5'- 

CTCACCCGGACCCTGCTTGCGTAG-3\ PCR was performed in an Omnigene thermocycler 
with 35 cycles of 94°C for 1 min, 55°C for 2 min, 72°C for 1 min. The amplification product 
was gel-purified and ligated into vector pCR2.1 (Invitrogen). 

Cosmid hybridizations: A transducing lysate of a cosmid library of M. 
15 tuberculosis H37Rv genomic DNA in vector pYA3060 was used. Cosmid-bearing E. coli 
CD2819T (Jacobs et aL, InfecJmmun. 52:101. 1986) colonies representing roughly five genomic 
equivalents were individually picked to wells of sterile 96- well microliter dishes and 
propagated at 30°C in Luria broth containing ampicillin at 30 jig/ml and thymidine at 50 
(ig/ml. Colonies were grown overnight at room temperature on the above media as nylon 
. 20 filter replicas of the library. Filters were processed for colony hybridization by standard 
methods and probe hybridizatjpns performed as described below. Cosmid DNAs were purified 
using maxiprep columns (Qiagen). 
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Southern blot analysis: Restriction enzyme digests of M tuberculosis H37Rv 
chromosomal or cosmid DNAs were resolved by electrophoresis on 1% TAE-agarose gels and 
blotted to nylon filters using standard methods. Probe labeling and hybridizations were 
performed using the Gene Images hybridization system (Amersham) essentially as described 
5 by the supplier except that hybridizations and washes were carried out at 70°C. 

DNA sequencing and analysis: Plasmid templates for nucleotide sequencing 
were purified using RPM miniprep kits (Biol 01). PCR cycle sequencing (ABI Prizm) was 
carried out with an Applied Biosystems automated sequencer. 

10 B. Strategy for M tuberculosis rvoA gene isolation 

A fragment of the M. tuberculosis rpoA gene was isolated by PCR, using 
oligonucleotide primers homologous to defined regions of a that are conserved among Gram- 
positive and Gram-negative bacterial species. Amino acid sequence alignments of the a 
subunits of B, subtilis and E. coli revealed several short regions of identity clustered primarily 

15 within the N-terminal half of a implicated in association with RNA polymerase core^subunits. 
Primers corresponding to conserved a segments LRRILLSS and VTAADITHD (amino acids 
39-46 and 106-114, respectively of the B. subtilis subunit) specifically amplified a DNA 
fragment of anticipated size (228 bp) using M. tuberculosis H37Rv genomic DNA as a 
template. To confirm that the amplified DNA contained rpoA sequences, the 228 bp fragment 

20 was subcloned and a partial nucleotide sequence determined. The deduced amino acid 
sequence of the PCR product showed significant homology to the a subunits of other bacteria 
(Figure 1), confirming its identity as an rpoA gene fragment. One ORF of the 228 bp 
fragment displayed high degree of identity to corresponding intervals of the B. subtilis (61%) 
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and E. coli (55%) a sublimits. The A'/, tuberculosis rpoA -derived sequence had an overall G+C 
content of 60% while the G+C content for bases occupying the third position of each codon 
increased to 76%, values in good agreement with compositions of other M. tuberculosis genes 
(Weiden et al., in Tuberculosis. Little. Brown, 1996. pp. 21 1-222). 

5 

C. Southern blot analysis and isolation of the entire M tuberculosis rpoA gene 

To establish that the cloned 228 bp fragment was mycobacterial in origin, 
Southern blot analysis of restriction enzyme digests of M tuberculosis H37Rv genomic DNA 
was pcrlormcJ. The PCR probe detected unique restriction fragments in chromosomal digests 

10 (Figure 2Hi These data are consistent with a single copy of the rpoA gene in the M. 
tubcr^ t4tn\t\ chromosome (two bands of hybridization observed in the EcoRl digest result from 
the presence of an EcoRl site within the 228 bp probe sequence). The full-length rpoA gene 
was obtained from a cosmid library of M. tuberculosis H37Rv genomic fragments using the 
228 bp PC R probe. Screening of approximately 600 cosmid-bearing E. coli colonies 

15 (representing roughly 5 genome equivalents) with the rpoA gene fragment yielded 4 positive 
clones. Restriction analysis indicated that the 4 isolates differed from one another with respect 
to total insert size, though all shared a roughly 10 kb Not\ fragment which hybridized to the 
rpoA gene probe (Figure 2A). One rpoA -hybridizing cosmid clone, designated ICS, was 
chosen for further analysis. Southern blotting of IC5 DNA digested with a panel of restriction 

20 enzymes confirmed that the no gross structural rearrangements of the rpoA gene had occurred 
during cloning (Figure 2B). The 1.4 kb SacII and 1.3 kb Sail rpoA -hybridizing fragments of 
cosmid 1C5 were subcloned into vector pSKII+ prior to nucleotide sequencing. 
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D. Sequence analysis of the M tuberculosis rpoA gene 

Nucleotide sequencing was performed on plasmid subclones and on cosmid IC5 
DNA. Nucleotide and deduced amino acid sequences of the rpoA gene of M tuberculosis are 
shown in Figure 3. The sequence encodes a 1044 bp ORF which has an overall G+C 
5 composition of 63% (85% for bases occupying the codon third position). Assuming that the 
ATG at position 85-87 serves as the initiator codon, the ORF is expected to encode a protein 
of 347 amino acids. A strong match with the consensus sequence for an M tuberculosis 
ribosome binding site (GAAAGGA), (Novick, in Tuberculosis, Little. Brown, 1996, pp. 187- 
198) is positioned just upstream of the putative ATG codon. Examination of more than 80 
10 bp of nucleotide sequence upstream of the translation start site did not reveal regions of exact 
identity with prokaryotic promoter sites. Among a subunits studied in other bacterial species, 
the deduced amino acid sequence of the 347 residue M. tuberculosis protein displayed greatest 
similarity (48% identity) to the a subunit of the Gram-positive sporulating bacterium B. 
subtilis (Figure 4). 

15 

Example 2: Expression of ML tuberculosis RNA Polymerase cv Subunit and 
Reconstitution of Active M. tuberculosis RNA Polymerase 

The following experiments were performed to purify M. tuberculosis RNA 
20 polymerase a subunit that had been expressed in £. coli and to reconstitute enzymatically 
active RNA polymerase holoenzyme. 
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A. Methods: 

Purification of M. tuberculosis a subwrit: Recombinant a was produced in E. 
coli BL2HDE3) cells (Nova'gen) and grown at 37°C in LB medium containing 25 ug/mi/ 
kanamycin. Overnight cultures (100 nil) were diluted 10-fold and grown in LB medium 
5 containing 25 ng/ml kanamycin until A 600 = 0.6 (approximately 75 min). Expression of a was 
induced for 3 hr with 4 mM IPTG. Purification of the a subunit was performd essentially as 
described by Tang, et. al ? MethEnzymol 273:130, 1996. Briefly, cell pellets were harvested 
by centrifugation for 10 min At 4°C (3000 x g). The pellet was resuspended in 20 ml buffer 
B (6 M guanidine HC1, 10 mM Tris (pH 7.9), 500 mM NaCl, 5 mM imidazole) and lysed by 

10 sonication. Inclusion bodies were removed by centrifugation at 42,000 x g for 30 min at 4°C. 
The sample was then adsorbed onto Ni 2 "-NTA (Qiagen) in buffer B. The sample was washed 
twice with 20 ml buffer B; washed twice with buffer B containing 30 mM imidazole; and 
eluted with 10 ml buffer B containing 500 mM imidazole. Adsorption, washes, and elution 
were performed with 1 min incubations at 4°C with gentle mixing. 

15 Reconstitution of a with other M. tuberculosis subunits to form core/holo- RNA 

polymerase: Reconstitution of core/holo- enzyme was performed by combining 60. 300. and 
600 jag of a, p, and (3\ respectively, followed slow removal of the denaturant by dialysis. The 
combined protein concentration was adjusted to 0.5 mg/ml with buffer B to prevent 
aggregation (Nobuyuki et al., Meth.Enzymol 273:121. 1996) and dialysed against buffer E (50 

20 mM Tris (pH 7.9), 200 mM KC1, 10 mM MgCl 2 , 10 ^M ZnCL. 1 mM EDTA, 5 mM p- 
mercaptoethanol, 20% (v/v) glycerol) overnight at 4°C. Following dialysis, the sample was 
activated by incubation for 45 min at 30°C and aggregates cleared by centrifugation at 10,000 
x g for 10 min at 4°C The sample was adsorbed onto Ni 2 ~-NTA in buffer F (50 mM Tris (pH 
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7.9). 0.5 mM EDTA. 5% (v/v) glycerol), and washed three times in buffer F containing 5 mM 
imidazole, and eluted in buffer F containing 150 mM imidazole. Adsorption and eluiion were 
performed by incubating for 45 min at 4°C and washes for 1 min. Each step was followed by 
ccntrifugation at 16.000 x g for 2 min at 4°C. Core/holo- enzyme was dialysed in buffer F 
5 (50 mM Tris (pH 7.9). 50 mM KC1, 0.5 mM EDTA. 5% (v/v) glycerol) overnight at 4°C. 

Purification ofM. tuberculosis holoenzyme: Core polymerase and subassemblies 
were separated from holoenzyme by elution from a MonoQ column (Pharmacia) using a 0.2 - 
0.5 M KC1 gradient. Fractions containing holoenzyme were pooled and used the in vitro 
transcription assay. 

10 Promoter construction: Recently, mycobacterium promoters have been cloned 

using a plasinid shuttle vector pSD7 (Das Gupta et al., J. BacterioL 175 :5186. 1993) and in 
a later stud\, the strength of these promoters was analyzed (Bashyam et al.. J. BacterioL 
17X:4N4?. 1996). Based on these studies two promoters, T125 and T101, identified as a weak 
and a strong promoter, respectively, were cloned into the pUC19 vector and used as templates 

15 for M tuberculosis transcription asays. 

Transcriptional activity: Holoenzyme was reconstituted by the addition of 
primary sigma factor MysA to the core polymerase and incubation at 30°C for 20 minutes. 
The transcription reaction was performed as described (Shorenstein et al.. 7. BioLChem. 
248:6170. 1973.) except that 3 jag of template (pUC19 ? pMC116, or pMC117) was used per 

20 reaction containing a high salt transcription buffer (50 mM Tris (pH7.9), 10 MgCU. 200 mM 
KCL 10 mM DTT, 0.1 mM EDTA, 1 mM K 2 HPO, (pH7.5). 100 j-ig/ml BSA). 50 ^il reactions 
were incubated for 30 minutes at 37°C. To precipitate the RNA transcripts and to stop the 
reaction. 100 jal of 10% TCA were added to the reaction. The TCA-precipitated RNA was 
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adsorbed onto UniFiller GF/C (Packard. Meriden. CT) double-thick glass fiber filtermats using 
a cell harvester (Packard. Meriden, CT). The wells of the microtiter plate and the filter were 
washed two times with 5% TCA and bound radioactivity was determined using a TopCount- 
HTS (Packard. Meriden, CT) scintillation counter. 

5 

B. Results: 

Recombinant M tuberculosis a subunit was overproduced in soluble form in 
E. coli cells transformed with plasmid pJH37, a derivative of pET26b, containing the entire 
rpoA coding region. M. tuberculosis a was expressed at high levels as a C-terminal 

10 hcxahistidine-tagged fusion protein and purified to homogeneity by affinity chromatography. 
The majority of the 6xHis-tagged a subunit was located in the soluble fraction and was further 
purified using a Ni 2 "-NTA column. Elution of the a subunit polypeptide was performed using 
0.5 M imidazole, resulting in >90% recovery of the subunits with >95% purity as estimated 
by SDS-PAGE (Figure 5). 

15 Core enzyme was reconstituted by combining crude preparations of the P and 

P % subunits with a polypeptide that contained a histidine tag. followed by removal of 
denaturant by dialysis to facilitate refolding. Holoenzyme was reconstituted by combining the 
core polymerase with purified primary a (MysA) from M. tuberculosis. The core/ holoenzyme, 
containing a 6xHis tag on the a subunit, was purified from free subunits and subassemblies 

20 by batch elution from Ni 2 ~-NTA followed by elution from ion exchange chromatography 
(Figure 6). 

To demonstrate that reconstituted M. tuberculosis RNA polymerase possesses 
transcriptional activity, in vitro transcription reactions were performed using MysA-containing 
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recombinant holoenzyme and templates containing two known MysA-dependem promoters 
(T101 and T125. which correspond to strong and weak mycobacterial promoters, respectively). 
The transcriptional activity of the reconstituted holoenzyme was more than two-fold higher 
when T101 was used as template as compared with T125 (Figure 7). suggesting that the rate 
5 of transcription was related to both the strength of the promoter and to the particular a factor 
used. 

These experiments demonstrated that M. tuberculosis a subunit produced 
according to the invention can be purified and reconstituted into enzymatically active RNA 
polymerase holoenzyme that exhibits specificity for M. tuberculosis promoters. 

10 

Example 3: High Throughput Screens For Inhibitors of M Tuberculosis RNA 
Polymerase and or Subunit 

High-throughput screens for anti-tuberculosis agents may be performed using 
15 either an in vitro or in vivo format. In either case, the ability of test compounds to inhibit M. 
tuberculosis RNA polymerase-driven transcription of M. tuberculosis promoters is tested. 
Typically, a strong promoter such as T101 is used for the test reactions. The sequence of the 
T101 promoter is: 

20 TlOl^ 

AGCTTGCAGATCTAGCGATCGCAGCCGACGTGATACCTGACCGTTGTTGATA 
GTGTCGGCGGCT-3 ' 

a) In vitro screens: 
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The following procedure is used for cell-free high-throughput screening. A 
Tomtec Quadra 96-well pipetting station is used to add the reaction components to 
polypropylene 96-well dishes. 5 \i\ aliquots of test compounds dissolved in DMSO (or DMSO 
alone as a control) are added to wells. This is followed by 20 \i\ of the RNA polymerase 
5 mixture, which consists of: 10 mM DTT, 200 mM KC1, 10 mM Mg~ 2 , 1.5 bovine serum 
albumin, and 0.25 \xg reconstituted RNA polymerase. After allowing the test compound to 
interact with the RNA polymerase, 25 jul of the DNA/NTP mixture is added, containing: 1 fig 
template DNA (see above), 4 |iM [ot- 32 P]-UTP, and 400 jaM each CTP, ATP. and GTP. 

After incubation for 30 min at 25°C, the reaction is stopped by addition of 150 
10 10% trichloroacetic acid (TCA). After incubation at room temperature for 60 min, the 
TCA-precipitated RNA is adsorbed onto double-thick glass fiber filtermats using a Tomtec cell 
harvester. The wells of the microtiter plate and the filter are washed twice with 5% TCA and 
bound radioactivity is determined usinjg a Wallac microbeta 1450 scintillation counter. 

Inhibitory activity due to the test compound is calculated according to the 

15 formula: 

(cpm positive control " Cpm samplc ) 

% inhibition = _ x 100 

e P m posiuvc control 

20 where cpm posiiive comro) represents the average of the cpm in wells that received DMSO alone, 
and cpm samplc represents the cpm in the well that received test compound. Compounds that 
cause at least 50% inhibition are scored as positive "hits" in this assay. 

As an additional control, rifampicin is used at a concentration of 30 nM, which 
results in a 50-75% inhibition of transcription in this assay. 
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b) In vivo screen: 

M. tuberculosis RNA polymerase subunits (a, p, p\ and a particular a subunit) 
are expressed in E. coli under the control of regulatable promoters by transforming E. coli 
with appropriate piasmids. If the a A subunit is expressed, a DNA sequence comprising the 
5 T101 promoter is also introduced into the cells to serve as a template for M. tuberculosis- 
specific transcription. 

In one embodiment, the T101 promoter sequence is linked to a DNA sequence 
encoding the xylE gene product, catechol 2, 3-dioxygenase (CDO). When expressed in the 
E. coli cell, CDO converts catechol to 2-hydroxymuconic semialdehyde, which has a bright 
10 yellow color (having an absorbance maximum at 375 nm) that is easily detected in whole cells 
or in crude extracts. The substrate for this enzyme is a small aromatic molecule that easily 
enters the bacterial cytoplasm and does not adversely affect cell viability. 

In a high-throughput format, aliquots of bacterial cultures are incubated in the 
absence or presence of test compounds, and CDO activity is monitored by measuring 
15 absorbance at 375 nm following addition of catechol. 

c) Specificity: 

Compounds that score as positive in either the in vitro or in vivo assays 
described above are then tested for their effect on human RNA polymerase II. Those 
20 compounds which do not significantly inhibit human RNA polymerase II will be further 
developed as potential anti-tuberculosis agents. 
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Claims : 

1. An isolated, purified DNA encoding M. tuberculosis RNA polymerase a subunit. 

5 2. A DNA as defined in claim 1. wherein said DNA has a sequence selected from the 
group consisting of the sequence shown in Figure 3 ? sequence-conservative variants thereof 
and function-conservative variants thereof. 

3. A DNA vector comprising the nucleic acid sequence of claim 2 operably linked to a 
10 transcription regulatory element. 

4. A cell comprising a DNA vector as defined in claim 3, wherein said cell is selected 
from the group consisting of bacterial fungal, plant, insect, and mammalian cells. 

5. A cell as defined in claim 4, wherein said cell is a bacterial cell. 

6. An isolated purified polypeptide comprising a polypeptide encoded by a DNA as 
defined in claim 2. 

7. A method for high-throughput screening to detect inhibitors of M tuberculosis RNA 
polymerase, said method comprising: 

a) providing a mixture comprising 

(i) purified M tuberculosis RNA polymerase, 
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(ii) a DNA template encoding a promoter sequence that is recognized 
by M. tuberculosis RNA polymerase; 

b) incubating said mixture in the presence of test compounds to form test 
samples, and in the absence of test compounds to form control samples, wherein said 
incubating is performed under conditions that result in RNA synthesis in the control samples; 

c) measuring RNA synthesis directed by said M. tuberculosis-recognized 
promoter in said test and control samples; and 

d) comparing the RNA synthesis detected in step (c) between said test and 
control samples; 

wherein an RNA polymerase inhibitor is a test compound that causes a 
reduction in RNA synthesis measured in said test sample relative to RNA synthesis measured 
in said control sample. 

8. A method as defined in claim 7, therein said purified M tuberculosis RNA 
polymerase comprises recombinant subunits which are reconstituted to form an enzymatically 
active holoenzyme. 

9. A method for high-throughput screening to detect inhibitors of M tuberculosis RNA 
polymerase, said method comprising: 

a) providing a non-mycobacterial bacterial strain that 

(i) has been transformed with a DNA template encoding a 

promoter sequence that is recognized by M. tuberculosis RNA polymerase, and 
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(ii) expresses enzymatically active M. tuberculosis RNA 

polymerase, wherein said polymerase comprises a. p/p\ and one of several a subunits; 

b) incubating the bacterial strain of (a) in the presence of test compounds 
to form test samples, and in the absence of test compounds to form control samples; 

c) measuring RNA synthesis directed by said M tuberculosis-recognized 
promoter in the test and control samples; and 

d) comparing the RNA synthesis detected in step (c) between the test and 

control samples; 

wherein an RNA polymerase inhibitor is a test compound that causes a 
reduction in RNA synthesis measured in said test sample relative to RNA synthesis measured 
in said control sample. 
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1/1 31/11 

CGA GTA CCC CCA CCT TCG CGG CCG CCG CCC CCG ACT CCC CCC ACA GAC GTC ATA TGG CCC 

rVpppsg apppsaptdvxwr 

61/21 RBS 91/31 

ACG Tf fi AAA EGA A GA AGA AAC ACC ATG CTG ATC TCA_CAG CCC CCC ACC CTG TCC GAG GAC 
T5KGRRNTMLI S Q R P T LSED 
121/41 151/51 

GTC CTC ACC GAC AAC CGA TCC CAG TTC GTG ATC GAA CCG CTG GAG CCG GGA TTC GGC TAC 
VLTDNRS O FVI E PL E PCFGY 
1B1/61 211/71 

ACC CTG GGC AAT TCG CTG CGT CGC ACC CTG CTG TCG TCC ATT CCC GGA GCG CCC GTC ACC 
TLGNSLRRTLLSS IPCAAVT 

241/81 271/91 

AGC ATT CGC ATC GAT GGT GTA CTG CAC GAA TTC ACC ACG GTG CCC GGG GTC AAA GAA GAT . 
SIRJDGVLHEFTTVPGVKED 

301/101 331/111 

GTC ACC GAG ATC ATC CTG AAT CTC AAG AGC CTG GTG GTG TCC TCG GAG GAG GAC GAG CCG 
VTEI ILNLK SLVVSSEEDEP 
361/121 391/131 

GTC ACC ATG TAC CTA CGC AAG CAG GGT CCG CGT GAG GTT ACC CCC GGC GAC ATC GTG CCG 
VTHYLRKQCPGEVTAGDIVP 

421/141 451/151 

CCG GCC GGC GTC ACC GTG CAC AAC CCC GGC ATG CAC ATC GCC ACG CTG AAC GAT AAG GGC 
PAGVTVHNP .GMHIATLNDK G 

481/161 511/171 

AAG CTG GAA GTC GAG CTC GTC GTC GAG CGT GGC CGC GGC TAT CTC CCG GCG GTG CAA AAC 
KL EVE LVVERGRG YVPAVQN 

541/181 571/191 

CGG GCT TCG GGT GCC GAA ATT GGG CGC ATT CCA GTC GAT TCC ATC TAC TCA CCG GTG CTC 
RASGAEIGRIPV DSIYSP VL 

601/201 631/211 

AAA GTG ACC TAC AAG GTG GAC GCC ACC CGG GTC GAG CAG CGC ACC GAC TTC GAC AAG CTG 
KVTYKVDAT RVEQRTDFDKL 

661/221 691/231 

ATC CTG GAC GTG GAG ACC AAG AAT TCA ATC AGC CCG CGC GAC CCG CTC GCG TCG GCT GGC 
ILDVETKNSISPRDALASAG 

721/241 7S1/251 

AAG ACG CTG GTC GAG TTG TTC GGC CTG GCA CGG GAA CTC AAC GTC GAG GCC GAA GGC ATC 
KTLV ELFGLAREI-NVEAEGI 

781/261 811/271 

GAG ATC GCG CCG TCG CCC GCC CAG GCC GAT CAC ATT GCG TCA TTC GCC CTG CCG ATC GAC 
E I C P S PA EA DH 1 AS F A L P ID 
841/281 871/291 

GAC CTC GAT CTG ACG GTG CGG TCC TAC AAC TCC CTC AAG CGC GAG GGG GTG CAC ACC GTG 
DLDLTVRSYNCLKR EGVHTV 

901/301 931/311 

GGC GAA CTG GTG GCG CGC ACC GAA TCC GAC CTG CTT GAC ATC CGC AAC TTC GGT CAG AAG 
GELVARTESDLLDIRNFGQK 

961/321 991/331 

TCC ATC GAC GAG GTG AAG ATC AAG CTG CAC CAG CTG GGC CTG TCA CTC AAG GAC AGC CCG 
SIDEVKIKLHQLCLSLKDSP 

1021/341 1051/351 

CCG AGC TTC GAC CCC TCG GAG GTC GCG GGC TAC CAC GTC GCC ACC GGC ACC TGG TCG ACC 
PSFDPSEVAGY DVAT GTWST 

1081/361 1111/371 

GAG GGC GCG TAC GAC GAG CAG GAC TAC CCC GAA ACC GAA CAC CTT TAG ACT CCC TCT AAT 
EGAYDEQDYAET EQ L * TASN 
1141/381 

CCA GAC ACG AGC GTC AGC TAT GCC CAA GCC 
PDRSVSYAQA 
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M. TUBERCULOSIS RNA POLYMERASE ALPHA SUBUNIT 

20 Field of the Invention 

The present invention relates to novel nucleic acids 
encoding RNA polymerase alpha subunit from M. tuberculosis and 
methods for use thereof. 

25 Background of the Invention 

The intracellular pathogen Mycobacterium tuberculosis 
is the causative agent of tuberculosis in humans and is 
responsible for millions of deaths worldwide each year (Bloom 
et al. Science 257:1055, 1992). The emergence of multidrug 

30 resistant forms of tuberculosis has mandated the development 
of. new antibiotics effective against refractory M. tuberculosis 
strains. Novel drugs which, like rifampin (Vall-Spinosa et 
al., N. Eng. J. Med. 283: 616, 1970), may be capable of 
inhibiting the prokaryotic transciptional machinery, could 

3 5 contribute significantly to the development of therapies to 
combat M. tuberculosis . 

The RNA polymerase of eubacteria consists of four 
subunits (or, 0, £' , and a) and exists in two major forms: core 
enzyme (a 2 , 0, 0' ) and holoenzyme (o> 2 , P, &' plus one of 
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several o subunits) ( Chamber lin, in RNA Polymerase, Losick et 
al., eds., Cold Spring Harbor, 1976, pp. 17-67) . The a subunit 
contains determinants for protein-protein interactions with 
transcription activators and protein-DNA interaction with 
5 upstream elements (Ishihama, Mol .Microbiol . 6: 3283, 1992; Russo 
et al., J. Biol.Chem. 267:14515, 1992; Ebright et al., 
Curr.Opin. Genet. Dev. 5:197, 1995) . The amino terminal domain 
of a is also required for assembly of the multisubunit core RNA 
polymerase (Ishihama, Adv. Biophys. 14:1, 1981) . The /S subunit 
10 is essential for transcript initiation and elongation, which 
(3' apparently functions in binding of the core enzyme to 
template DNA (Yura et al . , Ann. Rev. Genet. 13:59, 1979). Core 
RNA polymerase is capable of RNA synthesis; however, o is 
required for specific initiation of transcripts at promoter 
15 sequences (Gross et al . , in Transcriptional Regulation, Cold 
Spring Harbor, 1992, pp. 129-176) . 

Discovery of inhibitors of M. tuberculosis RNA 
polymerase- is hampered by a lack of information concerning 
components of the M. tuberculosis transcriptional apparatus, 
20 difficulties in obtaining sufficient yields of active enzymes 
for biochemical studies, and biosafety concerns. Establishment 
of an in vitro transcription system employing purified and 
reconstituted RNA polymerase would greatly advance efforts to 
identify new therapeutic agents active against tuberculosis. 
25 Accordingly, there is a need in the art for 

compositions and methods utilizing cloned genes and purified 
proteins derived from M. tuberculosis RNA polymerase. 

Summary of the Invention 

3 0 The present invention is based on the isolation and 

characterization of DNA encoding the a subunit of RNA 
polymerase derived from M. tuberculosis . In one aspect, the 
invention provides a purified, isolated nucleic acid having the 
sequence shown in Figure 3 . The invention also encompasses 

35 sequence -conservative and function- conservative variants of 
this sequence. The invention also provides vectors comprising 
these sequences, and cells comprising the vectors. 
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In another aspect, the present invention provides a 
purified, isolated polypeptide encoded by the nucleic acid 
sequence shown in Figure 3, as well as function-conservative 
variants thereof. In one embodiment, the invention provides 
5 a purified a subunit further comprising a hexahistidine tag. 
The invention also provides purified, reconstituted core- and 
holoenzyme comprising the a subunit. 

In yet another aspect, the invention provides in 
vitro methods for high- throughput screening to detect 
10 inhibitors of M. tuberculosis RNA polymerase. In one 
embodiment, the methods comprise: 

a) providing a mixture comprising 

(i) purified M. tuberculosis RNA polymerase and * 

(ii) a DNA template encoding a promoter 
15 sequence that is recognized by M. tuberculosis RNA polymerase; 

b) incubating the mixture in the presence of test 
compounds to form test samples, and in the absence of test 
compounds to form control samples, under conditions that result 
in RNA synthesis in the control samples; 

20 c) measuring RNA synthesis directed by said M. 

tuberculosis-recognized promoter in the test and control 
samples; and 

d) comparing the RNA synthesis detected in step (c) 
between the test and control samples. According to the 

25 invention, an RNA polymerase inhibitor is a test compound that 
causes a reduction in RNA synthesis directed by the M. 
tuberculosis- recognized promoter measured in the test sample 
relative to RNA synthesis measured in the control sample. In 
a preferred embodiment, the purified RNA polymerase used in 

3 0 practicing this method comprises recombinant subunits which are 
reconstituted to form an enzymatically active holoenzyme. In 
another embodiment, the ability of test compounds to bind to 
purified a subunit is monitored. 

In yet another aspect, the invention provides in vivo 
35 methods for high- throughput screening to detect inhibitors of 
M. tuberculosis RNA polymerase. The methods are carried out 
by the steps of: 
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a) providing a non-mycobacterial bacterial strain, 
preferably E. coli, that 

(i) has been transformed with a DNA template 
encoding a promoter sequence that is recognized by M. 

5 tuberculosis RNA polymerase, and 

(ii) expresses enzymat ically active M. 
tuberculosis RNA polymerase (e.g., ot 2 , /?' plus one of 
several a subunits) ; 

b) incubating the bacterial strain of (a) in the 
10 presence of test compounds to form test samples, and in the 

absence of test compounds to form control samples; 

c) measuring RNA synthesis directed by said AT. 
tuberculosis-recognized promoter in the test and control 
samples ; and 

15 d) comparing the RNA synthesis directed by said Ml 

tuberculosis-recognized promoter detected in step (c) between 
the test and control samples. According to the invention, an 
RNA polymerase inhibitor is a test compound that causes a 
reduction in RNA synthesis measured in the test sample relative 

20 to RNA synthesis measured in the control sample. 

These and other aspects of the present invention will 
be apparent to those of ordinary skill in the art in light of 
the present specification and appended claims. 

25 Brief Description of the Drawings 

Figure 1 is a graphic illustration comparing the 
deduced amino acid sequence of the M. tuberculosis rpoA gene 
fragment and corresponding regions of a subunits of B. subtil is 
and E. coli. Mt, M. tuberculosis; Bs, B. subtilis; Ec, E . coli. 

30 Dashed lines denote conserved segments of B . subtilis and E . 
coli proteins for which oligonucleotides were designed. 

Figure 2A is a photographic illustration of "a 
Southern blot analysis of rpoA hybridizing cosmid clones. Four 
different positive clones (designated IIA5, IIA3, ID9,.and IC5) 

35 isolated from an M. tuberculosis cosmid library were digested 
with NotI and hybridized with an rpoA gene probe. Sizes of DNA 
markers (X EcoT141, Amersham) are indicated in kb. 
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Figure 2B is a photographic illustration of a 
Southern blot analysis of rpoA hybridizing cosmid clones, 
showing structural conservation of genomic and cosmid-borne 
rpoA sequences. Purified cosmid IC5 DNA (lanes 1-5) or M. 
5 tuberculosis H37Rv genomic DNA (lanes 7-11) were digested with 
the indicated restriction enzymes and hybridized with an rpoA 
probe. Sizes of DNA markers (X EcoT141, Amersham) are 
indicated in kb. 

Figure 3 shows the nucleotide and deduced amino acid 
10 sequences of the M. tuberculosis H3 7Rv rpoA gene. RBS denotes 
a potential ribosome binding site. The putative start and stop 
codons of the rpoA gene product are at 85-87 and 1126-1128, 
respectively. 

Figure 4 is a graphic illustration comparing the 
15 inferred amino acid sequence of the M. tuberculosis H3 7Rv ci 
subunit with sequences of a subunits from other bacteria. B. 
sujbtilis (Bs) , M. genitalium (Mg) , E. coli (Ec) , H. influenza 
(Hi) , B. pertusis (Bp) , S. typhimurium (St) , and C. trachomitis 
(Ct) . Shading indicates identical amino acid residues. Amino 
20 acid sequence alignments were performed using MegAlign 
(DNAStar) . 

Figure 5 is a photographic illustration of an SDS- 
polyacrylamide gel electropherogram depicting the expression 
and purification of M. tuberculosis a subunit in E. coli. Lane 
25 1 shows molecular mass markers (kDa) ; lanes 2 and 3 show total 
proteins from uninduced and IPTG- induced cultures, 
respectively, of E. coli harboring pJH37; and lane 4 shows 
purified Af. tuberculosis a subunit (approximately 38 kDa), which is indicated 
by an arrow. 

3 0 Figure 6 is a photographic illustration of an SDS-polyacrylamide gel 

electropherogram showing the reconstitution and purification of core- and holo- RNA 
polymerase from A/, tuberculosis. Lanes 1 and 6 show molecular mass markers (kDa); 
lane 2 shows a-histidine-containing core enzyme; lane 3 shows a histidine-containing 
holoenzyme; lane 4 shows N-terminal histidine-tagged primary sigma; and lane 5 shows 

3 5 purified £. coli holoenzyme* 
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Figure 7 is a graphic illustration of in vitro transcription reactions employing 
reconstituted purified M. tuberculosis RNA polymerase holoenzyme and T101 and T125 
promoter-containing DNA templates. 

5 Detailed Description of the Invention 

All patent applications, patents and literature references cited herein are 
hereby incorporated by reference in their entirety. In the case of inconsistencies the 
present disclosure will prevail. 

The present invention is based on the isolation of a fragment of the M. 
1 0 tuberculosis rpoA gene, encoding the a subunit of RNA polymerase. As described in 
Example 1 below, the fragment was isolated by the polymerase chain reaction (PCR) using 
primers directed against regions of a that are highly conserved among Gram-positive and 
Gram-negative bacteria. The amplified DNA was utilized as a hybridization probe to 
recover the entire rpoA gene from a cosmid library of genomic DNA from virulent M. 
15 tuberculosis strain H37RV. Nucleotide sequencing indicated that the 1044 bp M 
tuberculosis rpoA open reading frame (ORF) encodes a protein of 347 amino acids which 
shows significant structural similarity to the a subunits of diverse bacterial species with 
greatest identity to the B. subtilis a protein (Figure 1). 

Though structurally conserved throughout most of its length, the inferred 
2 0 amino acid sequence of the M. tuberculosis a subunit diverged from that of B. subtilis and 
other bacteria at its extreme C-terminus. Interestingly, studies on E. coli RNA polymerase 
subunits have indicated that the C-terminal 94 residues of a function in interactions with 
certain transcriptional regulatory proteins, while the N-terminal domain may play a more 
highly conserved role in facilitating the core assembly (Ishihama, MolMicrobiol 6:3283, 
25 1992; Russo et aL, J.BioLChem. 267:14515, 1992). 

The B. subtilis rpoA gene is part of a large operon that is cotranscribed with 
genes for initiation factor 1 and ribosomal proteins B, S13, Sll, and LI 7; and, unlike the 
pattern of gene organization seen in E. colU the B. subtilis rpoA gene cluster is genetically 
linked to the rpoB-C operon encoding RNA polymerase components p and P' (Boylan et 
30 aL, J. Bacterioi 171:2553, 1992). 

In practicing the present invention, many techniques in molecular biology, 
microbiology, recombinant DNA, and protein biochemistry such as these explained fully 
in, for example, Sambrook et a/., 1989, Molecular Cloning: A Laboratory Manual, Second 
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Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York; DNA 
Cloning: A Practical Approach, Volumes I and II. 1985 (D.N. Glover ed.); 
Oligonucleotide Synthesis, 1984, (M.L. Gait ed.); Transcription and Translation, 1984 
(Hames and Higgins eds.); A Practical Guide to Molecular Cloning; the series, Methods 
5 in Enzymology (Academic Press, Inc.); and Protein Purification: Principles and Practice, 
Second Edition ( Springer- Verlag, N.Y.), may be used. 

The present invention encompasses nucleic acid sequences encoding the a 
subunit of M. tuberculosis RN A polymerase, enzymatically active fragments derived 
therefrom, and related sequences. As used herein, a nucleic acid that is "derived from" a 

10 sequence refers to a nucleic acid sequence that corresponds to a region of the sequence, 
sequences that are homologous or complementary to the sequence, and "sequence- 
conservative variants" and "function-conservative variants". Sequence-conservative variants 
are those in which a change of one or more nucleotides in a given codon position results 
in no alteration in the amino acid encoded at that position. Function-conservative variants 

15 are those in which a given amino acid residue in the a subunit has been changed without 
altering the overall conformation and function of the polypeptide, including, but not limited 
to, replacement of an amino acid with one having similar physico-chemical properties (such 
as, for example, acidic, basic, hydrophobic, and the like). Fragments of the a subunit that 
retain enzymatic activity can be identified according to the methods described herein, such 

2 0 as e.g., expression in £. coli followed by enzymatic assay of the cell extract or by 

purification and reconstitution of the purified polypeptide into an enzymatically active 
holoenzyme. 

The nucleic acids of the present invention include purine- and pyrimidine- 
containing polymers of any length, either polyribonucleotides or polydeoxyribonucleotides 
25 or mixed polyribo-polydeoxyribo nucleotides. This includes single- and double-stranded 
molecules, i.e., DNA-DNA, DNA-RNA and RNA-RNA hybrids, as well as "protein nucleic 
acids" (PNA) formed by conjugating bases to an amino acid backbone. This also includes 
nucleic acids containing modified bases. The nucleic acids may be isolated directly from 
cells. Alternatively, PCR can be used to produce the nucleic acids of the invention, vising 

3 0 either chemically synthesized strands or genomic material as templates. Primers used for 

PCR can be synthesized using the sequence information provided herein and can further 
be designed to introduce appropriate new restriction sites, if desirable, to facilitate 
incorporation into a given vector for recombinant expression. 
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The nucleic acids of the present invention may be flanked by natural M. 
tuberculosis regulatory sequences, or may be associated with heterologous sequences, 
including promoters, enhancers, response elements, signal sequences, polyadenylation 
sequences, introns, 5'- and 3'- noncoding regions, and the like. The nucleic acids may also 
5 be modified by many means known in the art. Non-limiting examples of such 
modifications include methylation, "caps", substitution of one or more of the naturally 
occurring nucleotides with an analog, and internucleotide modifications such as, for 
example, those with uncharged linkages (e.g., methyl phosphonates, phosphotriesters, 
phosphoroamidates, carbamates, etc.) and with charged linkages (e.g., phosphorothioates, 
10 phosphorodithioates, etc.). Nucleic acids may contain one or more additional covalently 
linked moieties, such as, for example, proteins (e.g., nucleases, toxins, antibodies, signal 
peptides, poly-L-lysine, etc.), intercalators (e.g., acridine, psoralen, etc.), chelators (e.g., 
metals, radioactive metals, iron, oxidative metals, etc.), and alkylators. The nucleic acid 
may be derivatized by formation of a methyl or ethyl phosphotriester or an alkyl 
1 5 phosphoramidate linkage. Furthermore, the nucleic acid sequences of the present invention 
may also be modified with a label capable of providing a detectable signal, either directly 
or indirectly. Exemplary labels include radioisotopes, fluorescent molecules, biotin, and 
the like. 

The invention also provides nucleic acid vectors comprising the disclosed 
20 a subunit sequences or derivatives or fragments thereof. A large number of vectors, 
including plasmid and fungal vectors, have been described for replication and/or expression 
in a variety of eukary otic' and prokaryotic hosts. Non-limiting examples include pKK 
plasmids (Clontech), pUC piasmids, pET plasmids (Novagen, Inc., Madison, WI), or 
pRSET or pREP (Invitrogen, San Diego, CA), and many appropriate host cells, using 
25 methods disclosed or cited herein or otherwise known to those skilled in the relevant art. 
Recombinant cloning vectors will often include one or more replication systems for cloning 
or expression, one or more markers for selection in the host, e.g. antibiotic resistance, and 
one or more expression cassettes. Suitable host cells may be 
transformed/transfected/infected as appropriate by any suitable method including 
3 0 electroporation, CaCl 2 mediated DNA uptake, fungal infection, microinjection, 
microprojectile, or other established methods. 

Appropriate host cells include bacteria, archebacteria, fungi, especially yeast, 
and plant and animal cells, especially mammalian ceils. Of particular interest are E, colU 
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B. sub til is, Saccharomyces cerevisiae, Saccharomyces carlsbergensis, Schizosaccharomyces 
pombl SF9 ceils, CI 29 cells, 293 cells, Neurospora, and CHO cells, COS cells, HeLa 
cells, and immortalized mammalian myeloid and lymphoid cell lines. Preferred replication 
systems include M13, ColEl, SV40, baculovirus, lambda, adenovirus, and the like. A 
5 large number of transcription initiation and termination regulatory regions have been 
isolated and shown to be effective in the transcription and translation of heterologous 
proteins in the various hosts. Examples of these regions, methods of isolation, manner of 
manipulation, etc. are known in the art. Under appropriate expression conditions, host cells 
can be used as a source of recombinantly produced Mycobacterial-derived peptides and 
1 0 polypeptides. 

Advantageously, vectors may also include a transcription regulatory element 
(i.e., a promoter) operably linked to the a subunit portion. The promoter may optionally 
contain operator portions and/or ribosome binding sites. Non-limiting examples of 
bacterial promoters compatible with E. coli include: trc promoter, (^lactamase 

15 (penicillinase) promoter; lactose promoter; tryptophan (trp) promoter; arabinose BAD 
operon promoter; lambda-derived PI promoter and N gene ribosome binding site; and the 
hybrid tac promoter derived from sequences of the trp and lac UV5 promoters. Non- 
limiting examples of yeast promoters include 3-phosphoglycerate kinase promoter, 
glyceraldehyde-3-phosphate dehydrogenase (GAPDH) promoter, galactokinase (GALI) 

20 promoter, galactoepimerase promoter, and alcohol dehydrogenase (ADH) promoter. 
Suitable promoters for mammalian cells include without limitation viral promoters such as 
that from Simian Virus 40 (SV40), Rous sarcoma virus (RSV), adenovirus (ADV), and 
bovine papilloma virus (BPV). Mammalian cells may also require terminator sequences 
and poly A addition sequences, and enhancer sequences which increase expression may 

25 also be included. Sequences which cause amplification of the gene may also be desirable. 
Furthermore, sequences that facilitate secretion of the recombinant product from cells, 
including, but not limited to, bacteria, yeast, and animal cells, such as secretory signal 
sequences and/or prohormone pro region sequences, may also be included. 

Nucleic acids encoding wild-type or variant a subunit polypeptides may also 

3 0 be introduced into cells by recombination events. For example, such a sequence can be 
introduced into a cell, and thereby effect homologous recombination at the site of an 
endogenous gene or a sequence with substantial identity to the gene. Other recombination- 
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based methods, such as non-homologous recombinations or deletion of endogenous genes 
by homologous recombination, may also be used. 

a subunit-derived polypeptides according to the present invention, including 
function-conservative variants, may be isolated from wild-type or mutant M. tuberculosis 
5 cells, or from heterologous organisms or cells (including, but not limited to. bacteria, fungi, 
insect, plant, and mammalian cells) into which an a subunit-derived protein-coding 
sequence has been introduced and expressed. Furthermore, the polypeptides may be part 
of recombinant fusion proteins (see, e.g., Example 2 below). Alternatively, polypeptides 
may be chemically synthesized by commercially available automated procedures, including, 
1 0 without limitation, exclusive solid phase synthesis, partial solid phase methods, fragment 
condensation or classical solution synthesis. 

"Purification" of an a subunit polypeptide refers to the isolation of the 
polypeptide in a form that allows its enzymatic activity to be measured without interference 
by other components of the cell in which the polypeptide is expressed. Methods for 
1 5 polypeptide purification are well-known in the art, including, without limitation, preparative 
disc-gel electrophoresis, isoelectric focusing, HPLC, reversed-phase HPLC, gel filtration, 
ion exchange and partition chromatography, and countercurrent distribution. For some 
purposes, it is preferable to produce the polypeptide in a recombinant system in which the 
protein contains an additional sequence tag that facilitates purification, such as, but not 
2 0 limited to, a polyhistidine sequence. The polypeptide can then be purified from a crude 
lysate of the host cell by chromatography on an appropriate solid-phase matrix. 
Alternatively, antibodies produced against the a subunit or against peptides derived 
therefrom can be used as purification reagents. Other purification methods are possible. 

The isolated polypeptides may be modified by, for example, 
2 5 phosphorylation, sulfation, acy lation, or other protein modifications. They may also be 
modified with a label capable of providing a detectable signal, either directly or indirectly, 
including, but not limited to, radioisotopes and fluorescent compounds. 

Screening Methods to Identify Anti-tuberculosis Agents 
30 The methods and compositions of the present invention can be used to 

identify compounds that inhibit the function of M. tuberculosis RNA polymerase and thus 
are useful as anti-tuberculosis agents. This is achieved by providing enzymatically active 
recombinant a subunit according to the present invention, in combination with other 
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components of RNA polymerase, in a context in which the inhibitory effects of test 
compounds can be measured. Alternatively, the ability of test compounds to bind to 
purified a subunit is monitored, using methods well-known in the art. Test compounds 
identified using the methods of the invention encompass those that interfere with the 
5 enzymatic activity of RNA polymerase and/or disrupt the interaction of the a subunit with 
transcriptional activators. Thus, the compounds may function as general or gene-specific 
transcriptional inhibitors. 

In a preferred embodiment, recombinant M. tuberculosis RNA polymerase 
subunits (a, p, P' plus one of several a subunits) are purified in milligram quantities from 
10 E. coli cultures by affinity methods utilizing a hexahistidine tagged a subunit. 

Enzymatically active holoenzyme is reconstituted using these components (see, e.g., 
Example 2 below). The active polymerase is then incubated in the presence of test 
compounds to form test mixtures, and in the absence of test compounds to form control 
mixtures. In vitro transcription is then carried out using a DNA template containing 
15 appropriate promoter sequences (see, e.g., Example 2 below). 

In another embodiment, M tuberculosis RNA polymerase subunits (ct 2 , p, 
P* plus one of several a subunits) are co-expressed in E. coli or another surrogate bacterial 
cell, in conjunction with an appropriate promoter functionally linked to a reporter gene. 
The ability of test compounds to differentially inhibit M. tuberculosis RNA polymerase is 
2 0 then assessed by measuring the amount and/or activity of the reporter gene product. 

M. tuberculosis promoters useful in practicing the invention include without 
limitation: PI or P2 hsp 60 promoters (Stover et al., Nature 351:456, 1991); cpn-60 
promoter (Kong et al., Proc.NatlAcadSci.USA 90:2608, 1993); 85A antigen promoter 
(Kremer, J. Bacteriol \Tk642 y 1995); PAN promoter (Murray et al., Mol Microbiol 
25 6:3331, 1992); 16S RNA promoter (Ji et al., Microbiol 140: 2829, 1994); and askp 
promoter (Cirillo et al., Mol Microbiol H:629, 1994). Useful reporter genes include 
without limitation xylE (Curcic et al., Mol Microbiol 13:1057, 1994); CAT (Das Gupta 
et al., J. Bacteriol 175 :5 1 86, 1 993); luciferase (Cooksey et al., Antimicrob.Agts. Chemother. 
37:1348, 1993); green fluorescent protein (Dhadayuthap et al., Mol Microbiol 17:901, 
30 1995); and lacZ (Silhavy et al., Microbiol Rev. 49:398, 1985). 

It will be understood that the present invention encompasses M. tuberculosis 
RNA polymerases containing any appropriate a factor, which is used in conjunction with 
a particular promoter that is recognized by RNA polymerase containing that a factor. 
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Non-limiting examples of useful a factors include a*, a B , and cr 5 (Doushkan et al., 1995; 
DeMaio et al., Gene 165:67, 1995). The invention also encompasses the identification of 
additional promoters that are recognized by a particular a subunit. This is achieved by 
providing a library of random M. tuberculosis gene fragments cloned upstream of an 
5 appropriate reporter gene (see above). The library is transformed into M tuberculosis or 
M smegmatis and reporter gene expression is measured. Alternatively, the library is 
transformed into another bacterial cell, such as, e.g., £. coli, which expresses M. 
tuberculosis RNA polymerase core subunits as well as a a subunit of interest and cognate 
promoters that drive reporter gene expression. In yet another embodiment, expression of 
10 an M tuberculosis or factor confers new recognition properties on E. coli RNA polymerase 
and permits isolation of promoters utilized specifically by a particular M tuberculosis a 
subunit. 

Preferably, both in vitro and in vivo screening methods to detect M: 
tuberculosis-specific RNA polymerase inhibitors are adapted to a high-throughput format, 
15 allowing a multiplicity of compounds to be tested in a single assay. Such inhibitory 
compounds may be found in, for example, natural product libraries, fermentation libraries 
(encompassing plants and microorganisms), combinatorial libraries, compound files, and 
synthetic compound libraries. For example, synthetic compound libraries are 
commercially available from Maybridge Chemical Co. (Trevillet, Cornwall, UK), 
20 Comgenex (Princeton, NJ), Brandon Associates (Merrimack, NH), and Microsource (New 
Milford, CT). A rare chemical library is available from Aldrich Chemical Company, Inc. 
(Milwaukee, WI). Alternatively, libraries of natural compounds in the form of bacterial, 
fungal, plant and animal extracts are available from, for example, Pan Laboratories 
(Bothell, WA) or MycoSearch (NC), or are readily producible. Additionally, natural and 
25 synthetically produced libraries and compounds are readily modified through conventional 
chemical, physical, and biochemical means (Blondelle et al., TibTech 14.60, 1996). 
preferably using automated equipment, to allow for the simultaneous screening of a 
multiplicity of test compounds. 

Useful anti-tuberculosis compounds are identified as those test compounds 
3 0 that decrease tuberculosis-specific transcription. Once a compound has been identified by 
the methods of the present invention as an RNA polymerase inhibitor, in vivo and in vitro 
tests may be performed to further characterize the nature and mechanism of the inhibitory 
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activity. For example, classical enzyme kinetic plots can be used to distinguish, e.g., 
competitive and non-competitive inhibitors. 

Compounds identified as RNA polymerase inhibitors using the methods of 
the present invention may be modified to enhance potency, efficacy, uptake, stability, and 
5 suitability for use in pharmaceutical formulations, etc. These modifications are achieved 
and tested using methods well-known in the art. 

The present invention is further described in the following examples which 
are intended to further describe the invention without limiting the scope thereof. 

10 Example 1: Isolation and Analysis of the M. tuberculosis rpoA Gene 

The following experiments were performed in order to clone the gene 
encoding M. tuberculosis RNA polymerase a subunit. In these experiments, the following 
materials and methods were used. 
A. Materials and Methods 

15 PCR amplification: M tuberculosis H37Rv genomic DNA (1 ng) was 

amplified in a 100 pi reaction containing: 10 mM Tris-HCl pH 8.3, 1.5 mM MgCl 2 , 500 
mM KC1, 2.5 U Taq polymerase (Beohringer-Mannheim), 0.2 mM dNTPs and 
oligonucleotide primers (1 pM each): 5 ' -CT ACGC AAGC AGGGTCCGGGTG AG-3 ' and 
5'-CTCACCCGGACCCTGCTTGCGTAG-3\ PCR was performed in an Omnigene 

2 0 thermocycler with 35 cycles of 94°C for 1 min, 55°C for 2 min, 72°C for 1 min. The 
amplification product was gel-purified and ligated into vector pCR2.1 (Invitrogen). 

Cosmid hybridizations: A transducing lysate of a cosmid library of M. 
tuberculosis H37Rv genomic DNA in vector pYA3060 was used. Cosmid-bearing E. coli 
02819T (Jacobs et al M Infec.Immun. 52:101, 1986) colonies representing roughly five 

2 5 genomic equivalents were individually picked to wells of sterile 96- well microliter dishes 

and propagated at 30°C in Luria broth cohtaining ampicillin at 30 |ag/ml and thymidine at 
50 |ig/ml. Colonies were grown overnight at room temperature on the above media as 
nylon filter replicas of the library. Filters were processed for colony hybridization by 
standard methods and probe hybridizations performed as described below. Cosmid DNAs 

3 0 were purified using maxiprep columns (Qiagen). 

Southern blot analysis: Restriction enzyme digests of M, tuberculosis 
H37Rv chromosomal or cosmid DNAs were resolved by electrophoresis on 1% TAE- 
agarose gels and blotted to nylon filters using standard methods. Probe labeling and 
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hybridizations were performed using the Gene Images hybridization system (Amersham) 
essentially as described by the supplier except that hybridizations and washes were carried 
out at 70°C 

DNA sequencing and analysis: Plasmid templates for nucleotide sequencing 
5 were purified using RPM miniprep kits (Biol 01). PCR cycle sequencing (ABI Prizm) was 
carried out with an Applied Biosystems automated sequencer. 

B. Strategy for M. tuberculosis rpoA gene isolation 

A fragment of the M. tuberculosis rpoA gene was isolated by PCR, using 
10 oligonucleotide primers homologous to defined regions of a that are conserved among 
Gram-positive and Gram-negative bacterial species. Amino acid sequence alignments of 
the a subunits of B. subtilis and E. coli revealed several short regions of identity clustered 
primarily within the N-terminal half of a implicated in association with RNA polymerase 
core subunits. Primers corresponding to conserved a segments LRRILLSS and 
15 VTAADITHD (amino acids 39-46 and 106-114, respectively of the B. subtilis subunit) 
specifically amplified a DNA fragment of anticipated size (228 bp) using M. tuberculosis 
H37Rv genomic DNA as a template. To confirm that the amplified DNA contained rpoA 
sequences, the 228 bp fragment was subcioned and a partial nucleotide sequence 
determined. The deduced amino acid sequence of the PCR product showed significant 
2 0 homology to the a subunits of other bacteria (Figure 1), confirming its identity as an rpoA 
gene fragment. One ORF of the 228 bp fragment displayed high degree of identity to 
corresponding intervals of the B. subtilis (61%) and E. coli (55%) a subunits. The M. 
tuberculosis r/?a4-derived sequence had an overall G+C content of 60% while the G+C 
content for bases occupying the third position of each codon increased to 76%, values in 

2 5 good agreement with compositions of other M. tuberculosis genes (Weiden et ah, in 

Tuberculosis, Little, Brown, 1996, pp. 211-222). 

C. Southern blot a nalysis and isolation of the entire M. tuberculosis rvoA gene 

To establish that the cloned 228 bp fragment was mycobacterial in origin, 

3 0 Southern blot analysis of restriction enzyme digests of M. tuberculosis H37Rv genomic 

DNA was performed. The PCR probe detected unique restriction fragments in 
chromosomal digests (Figure 2B). These data are consistent with a single copy of the rpoA 
gene in the A£ tuberculosis chromosome (two bands of hybridization observed in the 
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EcoRl digest result from the presence of an EcoRl site within the 228 bp probe sequence). 
The full-length rpoA gene was obtained from a cosmid library of M. tuberculosis H37Rv 
genomic fragments using the 228 bp PCR probe. Screening of approximately 600 cosmid- 
bearing E. coli colonies (representing roughly 5 genome equivalents) with the rpoA gene 
5 fra2ment yielded 4 positive clones. Restriction analysis indicated that the 4 isolates 
differed from one another with respect to total insert size, though all shared a roughly 10 
kb Notl fragment which hybridized to the rpoA gene probe (Figure 2A). One rpoA- 
hybridizing cosmid clone, designated IC5, was chosen for further analysis. Southern 
blotting of ICS DNA digested with a panel of restriction enzymes confirmed that the no 
10 gross structural rearrangements of the rpoA gene had occurred during cloning (Figure 2B). 
The 1.4 kb SacII and 1.3 kb Sail rpoA -hybridizing fragments of cosmid ICS were 
subcloned into vector pSKII+ prior to nucleotide sequencing. 

D Sequence analysis of the M. tuberculosis rpoA gene 
15 Nucleotide sequencing was performed on plasmid subclones and on cosmid 

ICS DNA. Nucleotide and deduced amino acid sequences of the rpoA gene of M. 
tuberculosis are shown in Figure 3. The sequence encodes a 1044 bp ORF which has an 
overall G+C composition of 63% (85% for bases occupying the codon third position). 
Assuming that the ATG at position 85-87 serves as the initiator codon, the ORF is 

2 6 expected to encode a protein of 347 amino acids. A strong match with the consensus 
sequence for an M. tuberculosis ribosome binding site (GAAAGGA), (Novick, in 
Tuberculosis, Little, Brown, 1996, pp. 187-198) is positioned just upstream of the putative 
ATG codon. Examination of more than 80 bp of nucleotide sequence upstream of the 
translation start site did not reveal regions of exact identity with prokaryotic promoter sites. 

25 Among a subunits studied in other bacterial species, the deduced amino acid sequence of 
the 347 residue M. tuberculosis protein displayed greatest similarity (48% identity) to the 
a subunii of the Gram-positive sporulating bacterium B. subtilis (Figure 4). 

Example 2: Expression of M. tuberculosis RNA Polymerase « Sub unit and 
3 0 Reconstitution of Active Af. tuberculosis RNA Polymerase - 
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The following experiments were performed to purify M. tuberculosis RNA 
polymerase a subunit that had been expressed in E. coli and to reconstitute enzymatically 
active RNA polymerase holoenzyme. 

5 A. Methods: 

Purification of M. tuberculosis « subunit: Recombinant a was produced in 
E. coli BL21(DE3) cells (Novagen) and grown at 37°C in LB medium containing 25 ng/ml/ 
kanamycin. Overnight cultures (100 ml) were diluted 10- fold and grown in LB medium 
containing 25 ng/ml kanamycin until A^ = 0.6 (approximately 75 min). Expression of 

10 a was induced for 3 hr with 4 mM IPTG. Purification of the a subunit was performd 
essentially as described by Tang, et. al, Meth.EnzymoL 173:130, 1996. Briefly, cell pellets 
were harvested by centrifugation for 10 min At 4°C (3000 x g). The pellet was 
resuspended in 20 ml buffer B (6 M guanidine HC1, 10 mM Tris (pH 7.9), 500 mM NaCl, 
5 mM imidazole) and lysed by sonication. Inclusion bodies were removed by 

15 centrifugation at 42,000 x g for 30 min at 4°C. The sample was then adsorbed onto Ni 2+ - 
NTA (Qiagen) in buffer B. The sample was washed twice with 20 ml buffer B; washed 
twice with buffer B containing 30 mM imidazole; and eluted with 10 ml buffer B 
containing 500 mM imidazole. Adsorption, washes, and elution were performed with 1 
min incubations at 4°C with gentle mixing. 

20 Reconstitute of a with other M tuberculosis subunits to form core/holo- 

RNA polymerase: Reconstitution of core/holo- enzyme was performed by combining 60, 
300, and 600 ng of a, 0, and p\ respectively, followed slow removal of the denaturant by 
dialysis. The combined protein concentration was adjusted to 0.5 mg/ml with buffer B to 
prevent aggregation (Nobuyuki et al., MethEnzymol 273:121, 1996) and dialysed against 

25 buffer E (50 mM Tris (pH 7.9), 200 mM KC1, 10 mM MgCl 2 , 10 nM ZnCl 2 , 1 mM 
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EDTA, 5 mM p-mercaptoethanol, 20% (v/v) glycerol) overnight at 4°C Following 
dialysis, the sample was activated by incubation for 45 min at 30°C and aggregates cleared 
by centrifugation at 10,000 x g for 10 min at 4°C The sample was adsorbed onto Ni 2+ - 
NTA in buffer F (50 mM Tris (pH 7.9), 0.5 mM EDTA, 5% (v/v) glycerol), and washed 
5 three times in buffer F containing 5 mM imidazole, and eluted in buffer F containing 150 
mM imidazole. Adsorption and elution were performed by incubating for 45 min at 4°C 
and washes for 1 min. Each step was followed by centrifugation at 1 6,000 x g for 2 min 
at 4°C. Core/holo- enzyme was dialysed in buffer F (50 mM Tris (pH 7.9), 50 mM KC1, 
0.5 mM EDTA, 5% (v/v) glycerol) overnight at 4°C 
10 Purification of M. tuberculosis holoenzyme: Core polymerase and 

subassemblies were separated from holoenzyme by elution from a MonoQ column 
(Pharmacia) using a 0.2 - 0.5 M KC1 gradient. Fractions containing holoenzyme were 
pooled and used the in vitro transcription assay. 

Promoter construction: Recently, mycobacterium promoters have been 
15 cloned using a plasmid shuttle vector pSD7 (Das Gupta et al., J. Bacteriol. 175 :5186, 
1993) and in a later study, the strength of these promoters was analyzed (Bashyam et al., 
J. Bacteriol 178:4847. 1996). Based on these studies two promoters, T125 and T101, 
identified as a weak and a strong promoter, respectively, were cloned into the pUC19 
vector and used as templates for M tuberculosis transcription asays. 
2 0 Transcriptional activity: Holoenzyme was reconstituted by the addition of 

primary sigma factor MysA to the core polymerase and incubation at 30°C for 20 minutes. 
The transcription reaction was performed as described (Shorenstein et al., 1 BiolChem. 
248:6170, 1973) except that 3 ng of template (pUC19, pMCl 16, or pMCl 17) was used per 
reaction containing a high salt transcription buffer (50 mM Tris (pH7.9), 10 MgCl 2 , 200 
25 mM KC1, 10 mM DTT, 0.1 mM EDTA, 1 mM K 2 HP0 4 (pH7.5), 100 >ig/ml BSA). 50 pal 
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reactions were incubated for 30 minutes at 37°C. To precipitate the RNA transcripts and 
to stop the reaction, 100 ^1 of 10% TCA were added to the reaction. The TCA- 
precipitated RNA was adsorbed onto UniFilter GF/C (Packard, Meriden, CT) double-thick 
glass fiber filtermats using a cell harvester (Packard, Meriden, CT). The wells of the 
5 microtiter plate and the filter were washed two times with 5% TCA and bound 
radioactivity was determined using a TopCount-HTS (Packard, Meriden, CT) scintillation 
counter. 

B. Results: 

1 0 Recombinant M tuberculosis a subunit was overproduced in soluble form 

in E colt cells transformed with plasmid pJH37, a derivative of pET26b, containing the 
entire rpoA coding region. M. tuberculosis a was expressed at high levels as a C-terminal 
hexahistidine-tagged fusion protein and purified to homogeneity by affinity 
chromatography. The majority of the 6xHis-tagged a subunit was located in the soluble 

15 fraction and was further purified using a Ni 2+ -NTA column. Elution of the a subunit 
polypeptide was performed using 0.5 M imidazole, resulting in >90% recovery of the 
subunits with >95% purity as estimated by SDS-PAGE (Figure 5). 

Core enzyme was reconstituted by combining crude preparations of the (3 
and (V subunits with a polypeptide that contained a histidine tag, followed by removal of 

20 denaturant by dialysis to facilitate refolding. Holoenzyme was reconstituted by combining 
the core polymerase with purified primary a (MysA) from M. tuberculosis. The core/ 
holoenzyme, containing a 6xHis tag on the a subunit, was purified from free subunits and 
subassemblies by batch elution from Ni^-NTA followed by elution from ion exchange 
chromatography (Figure 6). 
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To demonstrate that reconstituted M. tuberculosis RNA polymerase possesses 
transcriptional activity, in vitro transcription reactions were performed using MysA- 
containing recombinant holoenzyme and templates containing two known MysA-dependent 
promoters (T101 and T125, which correspond to strong and weak mycobacterial promoters, 
5 respectively). The transcriptional activity of the reconstituted holoenzyme was more than 
two-fold higher when T101 was used as template as compared with T125 (Figure 7), 
suggesting that the rate of transcription was related to both the strength of the promoter 
and to the particular a factor used. 

These experiments demonstrated that M. tuberculosis a subunit produced 
10 according to the invention can be purified and reconstituted into enzymatically active RNA 
polymerase holoenzyme that exhibits specificity for M tuberculosis promoters. 



Example 3: Hi gh Throughput Screens For Inhibitors of Af. Tuberculosis RNA 
Polymerase and or Subunit 



15 



High-throughput screens for anti-tuberculosis agents may be performed using 
either an in vitro or in vivo format. In either case, the ability of test compounds to inhibit 
M. tuberculosis RNA polymerase-driven transcription of M. tuberculosis promoters is 
tested. Typically, a strong promoter such as T101 is used for the test reactions. The 
2 0 sequence of the Tl 01 promoter is: 

T101:5' 

AGCTTGCAGATCTAGCGATCGCAGCCGACGTGATACCTGACCGTTGTTGATA 
GTGTCGGCGGCT-3 * 

25 

a) In vitro screens: 
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The following procedure is used for cell-free high-throughput screening. A 
Tomtec Quadra 96-well pipetting station is used to add the reaction components to 
polypropylene 96-well dishes. 5 jil aliquots of test compounds dissolved in DMSO (or 
DMSO alone as a control) are added to wells. This is followed by 20 (il of the RNA 
5 polymerase mixture, which consists of: 10 mM DTT, 200 mM KCl, 10 mM Mg*\ 1.5 jaM 
bovine serum albumin, and 0.25 ^g reconstituted RNA polymerase. After allowing the test 
compound to interact with the RNA polymerase, 25 ^1 of the DNA/NTP mixture is added, 
containing: 1 jag template DNA (see above), 4 p.M [a- 32 P]-UTP, and 400 \xM each CTP, 
ATP, and GTP. 

10 After incubation for 30 min at 25°C, the reaction is stopped by addition of 

150 jil 10% trichloroacetic acid (TCA). After incubation at room temperature for 60 min, 
the TCA-precipitated RNA is adsorbed onto double-thick glass fiber filtermats using a 
Tomtec cell harvester. The wells of the microtiter plate and the filter are washed twice 
with 5% TCA and bound radioactivity is determined usinjg a Wallac microbeta 1450 

15 scintillation counter. Inhibitory activity due to the test compound is 

calculated according to the formula: 

( C P m positive control " CP™ 

% inhibition = - x \qq 

C P m pos<ovc control 

20 

where cpm posilivcconlrel represents the average of the cpm in wells that received DMSO alone, 
and cpm^pfc represents the cpm in the well that received test compound. Compounds that 
cause at least 50% inhibition are scored as positive "hits" in this assay. 

As an additional control, rifampicin is used at a concentration of 30 nM, 
25 which results in a 50-75% inhibition of transcription in this assay. 



)CID:<WO 9824891A1 IB: 



SUBSTITUTE SHEET (RULE 26) 



WO 98/24891 PCT/US97/22216 
' 21 

b) In vivo screen: 

M tuberculosis RNA polymerase subunits (a, p, p\ and a particular a 
subunit) are expressed in E. coli under the control of reguiatable promoters by transforming 
E. coli with appropriate plasmids. If the cr* subunit is expressed, a DNA sequence 
5 comprising the T101 promoter is also introduced into the cells to serve as a template for 
M. tuberculosis-specific transcription. 

In one embodiment, the T101 promoter sequence is linked to a DNA 
sequence encoding the xylE gene product, catechol 2, 3-dioxygenase (CDO). When 
expressed in the E. coli cell, CDO converts catechol to 2-hydroxymuconic semialdehyde, 
10 which has a bright yellow color (having an absorbance maximum at 375 nm) that is easily 
detected in whole cells or in crude extracts. The substrate for this enzyme is a small 
aromatic molecule that easily enters the bacterial cytoplasm and does not adversely affect 
cell viability. 

In a high-throughput format, aliquots of bacterial cultures are incubated in 
15 the absence or presence of test compounds, and CDO activity is monitored by measuring 
absorbance at 375 nm following addition of catechol. 

c) Specificity: 

Compounds that score as positive in either the in vitro or in vivo assays 
20 described above are then tested for their effect on human RNA polymerase II. Those 
compounds which do not significantly inhibit human RNA polymerase ILwill be further 
developed as potential anti-tuberculosis agents. 
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1 . An isolated, purified DNA encoding M. tuberculosis RNA polymerase a subunit. 

5 2. A DNA as defined in claim 1 , wherein said DNA has a sequence selected from the 
group consisting of the sequence shown in Figure 3, sequence-conservative variants thereof, 
and function-conservative variants thereof. 

3. A DNA vector comprising the nucleic acid sequence of claim 2 operably linked to 
10 a transcription regulatory element. 

4. A cell comprising a DNA vector as defined in claim 3, wherein said cell is selected 
from the group consisting of bacterial, fungal, plant, insect, and mammalian cells. 

5. A cell as defined in claim 4, wherein said cell is a bacterial cell. 

6. An isolated purified polypeptide comprising a polypeptide encoded by a DNA as 
defined in claim 2. 

7. A method for high-throughput screening to detect inhibitors of M. tuberculosis RNA 
polymerase, said method comprising: 

a) providing a mixture comprising 

(i) purified M. tuberculosis RNA polymerase, 

(ii) a DNA template encoding a promoter sequence that is 
recognized by M. tuberculosis RNA polymerase; 
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b) incubating said mixture in the presence of test compounds to form 
test samples, and in the absence of test compounds to form control samples, wherein said 
incubating is performed under conditions that result in RNA synthesis in the control 
samples; 

c) measuring RNA synthesis directed by said M. tuberculosis-recognized 
promoter in said test and control samples; and 

d) comparing the RNA synthesis detected in step (c) between said test 
and control samples; 

wherein an RNA polymerase inhibitor is a test compound that causes a 
reduction in RNA synthesis measured in said test sample relative to RNA synthesis 
measured in said control sample. 

8. A method as defined in claim 7, wherein said purified M tuberculosis RNA 
polymerase comprises recombinant subunits which are reconstituted to form an 
enzymatically active holoenzyme. 

9. A method for high-throughput screening to detect inhibitors of M tuberculosis RNA 
polymerase, said method comprising: 

a) providing a non-mycobacterial bacterial strain that 

(0 has been transformed with a DNA template encoding 

a promoter sequence that is recognized by M. tuberculosis RNA polymerase, and 

(») expresses enzymatically active M tuberculosis RNA 

polymerase, wherein said polymerase comprises a, p, p\ and one of several a subunits; 
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b) incubating the bacterial strain of (a) in the presence of test 
compounds to form test samples, and in the absence of test compounds to form control 
samples; 

c) measuring RNA synthesis directed by said M. tuberculosis-recognized 
promoter in the test and control samples; and 

d) comparing the RNA synthesis detected in step (c) between the test 
and control samples; 

wherein an RNA polymerase inhibitor is a test compound that causes a 
reduction in RNA synthesis measured in said test sample relative to RNA synthesis 
measured in said control sample. 
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